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Abstract 

Background: Autism spectrum disorders (ASDs) are male-biased and genetically heterogeneous. While sequencing 
of sporadic cases has identified de novo risk variants, the heritable genetic contribution and mechanisms driving the 
male bias are less understood. Here, we aimed to identify familial and sex-differential risk loci in the largest available, 
uniformly ascertained, densely genotyped sample of multiplex ASD families from the Autism Genetics Resource 
Exchange (AGRE), and to compare results with earlier findings from AGRE. 

Methods: From a total sample of 1,008 multiplex families, we performed genome-wide, non-parametric linkage 
analysis in a discovery sample of 847 families, and separately on subsets of families with only male, affected children 
(male-only, MO) or with at least one female, affected child (female-containing, FC). Loci showing evidence for 
suggestive linkage (logarithm of odds >2.2) in this discovery sample, or in previous AGRE samples, were re-evaluated in 
an extension study utilizing all 1,008 available families. For regions with genome-wide significant linkage signal 
in the discovery stage, those families not included in the corresponding discovery sample were then evaluated 
for independent replication of linkage. Association testing of common single nucleotide polymorphisms (SNPs) 
was also performed within suggestive linkage regions. 

Results: We observed an independent replication of previously observed linkage at chromosome 20p13 (P < 0.01), 
while loci at 6q27 and 8q1 3.2 showed suggestive linkage in our extended sample. Suggestive sex-differential linkage 
was observed at 1 p31 .3 (MO), 8p21 .2 (FC), and 8p1 2 (FC) in our discovery sample, and the MO signal at 1 p31 .3 was 
supported in our expanded sample. No sex-differential signals met replication criteria, and no common SNPs were 
significantly associated with ASD within any identified linkage regions. 

Conclusions: With few exceptions, analyses of subsets of families from the AGRE cohort identify different risk loci, 
consistent with extreme locus heterogeneity in ASD. Large samples appear to yield more consistent results, and 
sex-stratified analyses facilitate the identification of sex-differential risk loci, suggesting that linkage analyses in 
large cohorts are useful for identifying heritable risk loci. Additional work, such as targeted re-sequencing, is 
needed to identify the specific variants within these loci that are responsible for increasing ASD risk. 
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Background 

Autism spectrum disorders (ASDs) are a group of neuro- 
developmental conditions characterized by severe social 
impairment that affect 1 in 88 individuals [1]. Genetic 
factors have long been known to contribute significantly 
to ASD risk based on twin studies [2], the sibling recur- 
rence risk [3,4], and elevated rates of comorbid ASD in 
populations with a wide variety of monogenic syndromes 
such as Fragile X or Timothy Syndrome [5,6]. ASDs are 
also known to present heterogeneously across the popula- 
tion of affected individuals, and results from recent 
genetic studies strongly suggest that genetic risk factors 
for ASD are similarly diverse. Copy number variant (CNV) 
and exome sequencing studies of sporadic ASD cases from 
single-incidence ("simplex") families have found numerous 
novel, de novo risk variants [7-13], and no significant signal 
for rare inherited variation. Estimates based on these 
findings project that approximately 1,000 genes are likely 
to contribute to ASD etiology. 

While a highly productive approach for gene discovery, 
the study of simplex families is designed to identify mostly 
the non-inherited genetic component of ASD risk: rare 
variants resulting from de novo mutations, in which 
variants arise in the germ cell and are not carried by the 
mother or father. However, evidence of high heritability 
for ASD [14], high sibling recurrence risk [3,4], and aggre- 
gation of subthreshold ASD-like phenotypes in families 
[15-18] suggest that inherited genetic variation also plays 
a significant role in ASD etiology. 

Additionally, while germline mutations, potentially shared 
between affected siblings, may also plausibly affect ASD risk 
in multiplex families, current evidence suggests that rare de 
novo CNV events are more prevalent among sporadic cases 
than cases from multiple-incidence ("multiplex") families 
[13]. Largely, however, the specifics of the genetic archi- 
tecture of ASDs that differ between simplex and multiplex 
families are unknown. Therefore, studies of familial trans- 
mission to identify regions of genetic linkage in multiplex 
families remain an important approach to identifying 
predisposing genes. 

Another important clue to ASD etiology lies in its 
consistently male-biased prevalence [19]. There is an 
approximately 4:1 male bias, a phenomenon that is likely 
driven, or at least influenced, by the actions of sex-specific 
biological factors, such as sex chromosomes or steroid 
hormones that potentiate and attenuate ASD risk in males 
and females, respectively [20]. Indeed, several ASD and 
intellectual disability risk genes have been identified on 
the X chromosome [6,21], including FMR1 [22], NLGN4X, 
and NLGN3 [23], demonstrating that in some cases ASD 
may be X-linked. However, the proportion of ASD cases 
currently attributable to X-linked variants remains insuffi- 
cient to account for the degree of male bias observed in 
ASD prevalence. The notion of female protective factors 



on a broader scale is supported by the observation of an 
increased proportion of autistic females relative to males 
carrying variants of large effect size, such as large CNVs 
or deleterious single nucleotide variants (SNVs) [8,11,13]. 
However, with few exceptions [24], it is unknown which 
specific autosomal risk variants are differentially penetrant 
by sex, thus contributing to the sex bias in ASD prevalence. 
For example, a greater number of variants may be associ- 
ated with ASD risk in males as compared to the number of 
variants that also, or specifically, confer risk to females. The 
discovery of such sex-differential risk loci would provide 
genetic clues for investigation of the biological mechanisms 
driving the ASD male bias. However, because these signals 
are likely masked by heterogeneity within sex-mixed 
cohorts, stratification of multiplex ASD family cohorts by 
proband sex may facilitate the identification of novel, 
sex-differential loci harboring inherited risk variants. 

Most previous linkage studies of ASD have used relatively 
small samples (<350 families) and markers with coarse 
resolution and incomplete information [25-31], and there 
has been little agreement between studies in the reported 
findings. Furthermore, the larger studies [32,33] combine 
subjects drawn from several diverse populations with 
different ascertainment schemes. Multiplex family samples 
ranging from 109 to 753 families from the Autism Genetics 
Resource Exchange (AGRE) cohort have been previously 
tested for linkage [25-28,33,34], and the linked risk loci re- 
ported by these analyses, with a few exceptions [25,28], 
show little agreement. This may be due to genetic hetero- 
geneity, small sample sizes, or sparse marker coverage. 

In this study, we used a pruned set of single nucleotide 
polymorphism (SNP) markers providing nearly complete 
linkage information on all autosomes and the X chromo- 
some (information content greater than 0.976 for 99.5% 
of regions covered by SNP genotypes) for our analyses 
in the largest available AGRE sample. Specifically, we 
performed non-parametric linkage testing in three stages 
to: 1) identify novel risk loci in a discovery sample of 847 
multiplex families; 2) confirm loci identified in this dis- 
covery sample, or reported by earlier analyses of AGRE 
samples, in an extended sample of 1,008 multiplex families; 
and 3) test for formal replication of genome-wide sig- 
nificant linkage signals in the independent portion of 
families not tested in the corresponding discovery studies 
(Figure 1). 

For all stages, we also applied a stratification approach 
used previously [25,28,32] to identify sex-differential loci, 
now using a sample three times as large as the original and 
a panel of markers that are substantially more informative 
[28]. We then used the family-based association transmis- 
sion disequilibrium test (TDT) on genotyped and imputed 
SNPs within linked regions to identify common variants 
conferring increased risk for ASD. With the addition of 343 
AGRE families beyond those tested in the most recent 
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1 a) Stage 1 discovery study 



Genome-wide linkage analysis 

847 multiplex families from AGRE 

SNP genotypes from lllumina 550K & Omni-1 



1 b) Linkage signals in 



Source 


Subgrp 


Locus 


Peak 
LOD 


Liu et al. (2001) 


ALL 


5p13.1 


2.55 


ALL 


Xq26.1 


2.56 


Yonan et al. (2003) 


ALL 


5p13.1 


2.54 


ALL 


11p13-11.2 


2.24 


ALL 


17q11.2 


2.83 


Stone et al. (2004) 


MO 


17q11 


4.3* 


FC 


4q32.3-35.1 


2.7 


Cantor et al. (2005) 


MO 


17q21 


3.6* 


McCauley et al. 
(2005) 


ALL 


3p25.3 


2.22 


Szatmari et al. 
(2007) 


MO 


5q12 


2.31 


MO 


9q33.3 


2.36 


ALL 


11p12 


2.77 


FC 


11p12 


2.86 


Weiss et al. (2009) 


ALL 


6q27 


2.94 


ALL 


20p13 


3.81* 



2) Combined 
staqe extension 




study 

• Regional linkage 
analysis in loci with 
suggestive LOD 
scores 

• 1008 multiplex 
families from AGRE 

• SNP genotypes from 
lllumina Omni-1 & 
Omni-2.5 


> 



3) Replication 
study 

• Regional linkage 
analysis in loci with 
genome-wide 
significant LOD 
scores (LOD>3.6) 

• Multiplex families 
from AGRE not 
tested in the 
corresponding 
discovery sample 



* Genome-wide significant linkage 

Figure 1 Schematic of three-stage study design for linkage analyses. Linkage analyses were performed using a three-stage approach, 
(la) A discovery sample (Stage 1) and (lb) previous linkage results were used to identify loci of suggestive linkage (logarithm of odds (LOD) >2.2). 
(2) These suggestive loci were tested for linkage in an extended sample. (3) For loci achieving genome-wide significant linkage (LOD >3.6) in either 
the discovery, extended, or previously published studies, the non-overlapping subset of families was tested for independent replication of linkage. 
AGRE, Autism Genetics Resource Exchange; SNP, single nucleotide polymorphism. 



linkage analysis using subjects from AGRE [33], this com- 
prises the largest linkage study for ASD, and sex-differential 
risk, of families from a single, uniformly ascertained cohort. 
This also allows us to attempt replication of previous 
linkage findings. 

Methods 

Subjects and genotyping 

Subjects were individuals from the AGRE collection 
of nuclear families, including affected probands, their 
unaffected siblings, and their parents. Subjects provided 
written informed consent or, if minors, assent with agree- 
ment from their parents to AGRE for diagnosis and 
blood collection. This study was approved by the Western 
Institutional Review Board (AGRE), the Institutional 
Review Board at Washington University (subject recruit- 
ment, principal investigator: John Constantino), and by 
the Medical Institutional Review Board 3 at the University 
of California, Los Angeles. 

Individuals with study diagnoses of full autism, not quite 
autism, or broad spectrum disorder based on a clinicians 
best estimate given Autism Diagnostic Interview-Revised 
and Autism Diagnosis Observation Schedule scores were 
considered to be affected, in accordance with the inclusive, 
single spectrum concept of ASD as now codified in the 
Diagnostic and Statistical Manual-5. In the AGRE cohort, 
a study diagnosis of not quite autism is given to subjects 
who meet the age of onset criterion but score no more 
than one point short of the autism cutoffs in one or more 
symptom domains (social, communication, restricted and 



repetitive behavior), or subjects who meet autism cutoffs 
in all symptom domains, but do not meet the age of onset 
criterion. A study diagnosis of broad spectrum disorder is 
given to subjects with pervasive developmental disorders 
and varying levels of impairment; these subjects include 
individuals with conditions like pervasive developmental 
disorder not otherwise specified (PDD-NOS) and Asperger's 
syndrome. 

Probands with syndromic autism, significant dysmor- 
phology, documented pre- or peri-natal insult, abnormal 
imaging or medical test, premature birth at less than 35 
gestational weeks, or chromosomal abnormalities were 
not included; additional neuropsychiatric phenotypes in 
parents and unaffected siblings were not applied as inclu- 
sion or exclusion criteria. Here, chromosomal abnormalities 
refer to clinically relevant CNVs identified by karyotype and 
to recurrent CNVs such as 16pll.2 deletions and duplica- 
tions independently reported to AGRE by investigators. 
Subjects with evidence for de novo missense SNVs (n = 5 
cases) [9] were not excluded, due to the current uncertainty 
in determining the effects of missense variation on ASD 
risk. In instances of monozygotic multiples, only one 
proband was selected at random for inclusion. Subjects 
in the AGRE cohort include individuals of Caucasian, 
African- American, Asian, and Hispanic ancestry as noted 
by self-report and multi-dimensional scaling from genotype 
data; subjects were not filtered by ancestry, as the genetic 
analyses used in this study (non-parametric linkage, TDT) 
were family-based and therefore not susceptible to the 
introduction of false positive results from population 
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stratification. However, we note that including multiple 
ethnicities may introduce or exacerbate locus hetero- 
geneity, which is unlikely to falsely inflate logarithm of 
odds (LOD) scores, but instead may reduce power in 
linkage studies [32]. 

Subjects were genotyped in two stages, using DNA puri- 
fied from lymphoblastoid cell lines and obtained from the 
Rutgers University Cell and DNA Repository (Piscataway, 
NJ, USA). Stage 1 consisted of individuals from 1,191 
AGRE families, with subjects from 941 families genotyped 
on the Illumina 550K genome-wide SNP array as de- 
scribed previously [35], and individuals from an additional 
250 AGRE families typed on the Illumina Omni-1 Quad 
array (Illumina, San Diego, CA, USA) in the University of 
California, Los Angeles, Neuroscience Genomics Core. 
Stage 2 consisted of individuals from an additional 396 
AGRE families, 116 of which were genotyped on the 
Illumina Omni-1 array, and 280 of which were typed on 
the Illumina Omni-2.5 array (Additional file 1: Table SI). 
In total, the combined sample of stage 1 plus stage 2 
was comprised of individuals from 1,587 families, 1,008 
of which were multiplex and met inclusion criteria as 
described above. 

Recorded sample identity and pedigree relationships 
were validated by evaluating estimations of identical by 
descent allele sharing across the genome within and 
between families using PLINK software [36]. In cases 
with evidence of identity swaps, where available, geno- 
type data from the Broad Institute (Affymetrix 5.0 
[33]) and the Autism Genome Project (Illumina 1M 
[37,38]) were compared to make a final determination 
of identity. Subjects and SNPs with >5% missing data were 
excluded, and SNPs with Hardy- Weinberg equilibrium 
P- values <0.0000001, minor allele frequency <0.01, and >10 
Mendelian errors were also excluded. Filtered data sets 
from the two stage 1 genotyping platforms were then 
merged using PLINK to generate a stage 1 sample data 
set with all remaining subjects and the union of marker 
sets from the 550K and Omni-1 platforms for a total of 
1,092,577 SNPs. To incorporate stage 2 and build the 
combined sample data set of all genotyped AGRE subjects, 
the union of filtered data from all genotyping platforms 
(550K, Omni-1, Omni-2.5) were merged using PLINK for 
a total of 1,684,432 SNPs. 

Linkage analyses using all families 

Autosomal and X chromosome markers common to all 
platforms and pruned to a linkage disequilibrium r 2 < 0.1 
with PLINK were used for genome-wide linkage analysis 
(stage 1 data: 57,929 SNPs; combined stage data: 53,648 
SNPs). These sets of independent SNPs were mapped to 
genetic positions using Rutgers Combined Linkage-Physical 
maps [39,40] for linkage testing. Non-parametric, multi- 
point linkage was performed genome-wide on all stage 1 



multiplex families meeting inclusion criteria (n = 847 fam- 
ilies) using Merlin (autosomes) and Minx (X chromosome, 
executable option within Merlin) [41], which applies the 
Kong and Cox linear model [42] to test for small increases 
in allele sharing across a large sample of families. 

Linked regions identified in the stage 1 sample or by 
previous reports (discovery samples) from non-parametric 
linkage analyses of AGRE samples using the "broad" af- 
fection status criteria were then identified for further 
evaluation by defining the 2-LOD interval surrounding 
suggestive (LOD >2.2) and significant (LOD >3.6) signals. 
Since the precise physical position of linkage peaks may 
vary between studies using marker sets of differing density 
and information content (see Cantor and colleagues [25] 
for an example), this 2-LOD interval is intended to be 
inclusive and to encompass any underlying variability 
in the location of previous signals. Where there were 
discrepancies between the genetic maps from the original 
studies and our data, the span of the interval was an- 
chored on the current physical position (hgl9) of the 
peak marker. Linkage signals from the study by Szatmari 
and colleagues [32] were reported as Z| r scores, which we 
converted to LOD scores by: LOD = Zf r /(2*ln(10)) [42]. 
Regions with suggestive linkage evidence from stage 1 or 
previous reports were tested for linkage in the combined 
sample. Regions at which the discovery sample showed 
genome-wide significant linkage (LOD >3.6) were then 
evaluated for replication (P < 0.01) within a 2-LOD inter- 
val from the peak LOD by testing only those independent 
families not previously evaluated in the corresponding 
discovery analysis. 

We applied this three-step (1, discovery; 2, extension 
and 3, replication) approach according to guidelines for 
linkage analyses of complex traits outlined by Lander 
and Kruglyak [43]. As preliminary linkage analyses for 
complex traits have yielded suggestive signals, extending 
sample sizes by adding pedigrees may allow these loci to 
reach genome-wide significance. Once a genome-wide 
significant signal has been identified, it may then be 
tested for independent replication in a new sample. 
The platform and stage at which a family was geno- 
typed determined whether they were included in the 
discovery (stage 1) or the extension (combined stage) 
stages (Figure 1). 

Sex-stratified linkage 

To identify sex- differential ASD risk loci, we stratified 
the genotyped, multiplex families into two groups based 
on the sex of their affected children: male-only (MO, no 
affected daughters), and female-containing (FC, at least 
one affected daughter) [28]; the number of affected- 
female-only multiplex families enrolled in AGRE is cur- 
rently too low to analyze these families as their own 
subgroup. In keeping with earlier sex-stratified linkage 
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analyses of AGRE families [25,28,32] and to maintain 
workable sample sizes, all families with two or more 
affected members were assigned by this simple criterion 
to either the MO or FC subset, irrespective of the total 
affected family members or the presence of unaffected 
siblings of the opposite sex. While it can be argued that 
MO families with three or more affected brothers or 
with at least one unaffected sister are more likely to 
carry truly male-specific risk variants (not penetrant in 
females) than MO families with fewer affected brothers 
or no unaffected sisters, the number of available AGRE 
families fitting these criteria is quite low. Specifically, 52 
MO families include three or more affected brothers, 
and 150 MO families have recorded unaffected sisters. 
To avoid further restricting the sample size of our subsets, 
we define the MO and FC subsets simply by the sex of the 
two or more affected siblings, under the assumption that 
the MO and FC subsets are enriched for male-specific and 
female-affecting risk variants, respectively, as compared 
with the full, non-stratified sample. By this definition, the 
stage 1 sample consisted of 487 MO (61%) and 314 FC 
(39%) families, and the combined sample consisted of 602 
MO (60%) and 406 FC (40%) families (Additional file 1: 
Table SI). 

Linkage analyses were performed separately in the two 
subgroups in three steps as described above for the non- 
stratified sample: 1) genome-wide non-parametric link- 
age analysis of MO and FC subgroups from the stage 1 
sample; 2) regional linkage analysis within suggestive 
(LOD >2.2) MO- or FC-specific linkage peaks from 
discovery studies [25,28] using the corresponding MO 
or FC subgroup from the combined sample; and 3) regional 
linkage analysis within peaks of genome-wide significant 
LOD (>3.6) in the independent portion of the correspond- 
ing MO or FC subgroup from the combined sample who 
were not previously tested in the discovery study. 

Additionally, to assess the statistical significance of 
sex-differential linkage signals, a randomization test of 
10,000 subsets of 487 families (matching stage 1 MO 
family N) and 314 (stage 1 FC family N) were analyzed 
for linkage across chromosomes 1, 4, 6, and 8, where 
subgroup-specific suggestive linkage peaks (LOD >2.2) 
were observed. For the combined sample, random subsets 
of 602 (MO family N) were analyzed across chromosome 
1, the only chromosome on which a sex-differential signal 
surpassed LOD 2.2 in the combined sample. On a 
marker-by-marker basis, the LOD score from each of the 
10,000 random trials was compared to the results from the 
corresponding original, subset-specific scan, and the frac- 
tion of random trials for which LOD random > LOD origina i 
was taken as the empirical P-value for stratification, 
reflecting the frequency with which the observed LOD 
magnitude would occur under the null hypothesis of 
no linkage. 



Imputation 

To further improve genotype coverage within linked 
regions for fine-scale association testing, imputation 
was performed separately by data set, as defined by geno- 
typing platform and data collection stage, using IMPUTE2 
[44] and a cosmopolitan reference panel from the 1000 
Genomes Project [45]. Imputed SNPs from each data set 
were then merged using GTOOL [46], and SNPTEST [46] 
was used to generate summary statistics for the merged 
set. Data were filtered to SNPs with an IMPUTE2 quality 
score >0.5, missing data in <5% of subjects, minor allele 
frequency >1%, and Hardy Weinberg P> 0.0000001. The 
final data set included 5,814,564 autosomal SNPs. 

Linkage-directed association testing 

Imputed SNPs within the 2-LOD intervals surround- 
ing linkage peaks exceeding the suggestive threshold 
(LOD >2.2) in either the stage 1 or combined sample 
were tested for association with ASD affection status 
in the family group corresponding to the linkage peak 
(all families (ALL), MO, or FC) using a TDT with adaptive 
gene-dropping permutations (PLINK). Gene dropping 
assignments are applied consistently across siblings to 
control for linkage and appropriately treat trios from 
multiplex families as non-independent [36]. 

In regions of MO or FC linkage signal, extended families 
without multiple affected siblings, but instead with multiple 
affected cousins, were additionally included in the associ- 
ation analysis (total MO n = 606, including 4 additional 
extended families; total FC n = 407, including 1 additional 
extended family). In regions of linkage signal from ALL, 
an additional 5 extended families and 508 families with 
only one genotyped, affected individual were included in 
the association analysis (total ALL n = 1,521 families). All 
additional families met inclusion criteria as described pre- 
viously. Association P-values were adjusted for multiple 
testing according to the number of independent SNPs 
within each region, as defined by a pairwise linkage 
disequilibrium r 2 < 0.3 (PLINK). 

Results 

By testing the largest available, uniformly ascertained 
ASD family sample (n = 1,008 multiplex families) for 
linkage, we aimed to identify novel ASD risk loci not 
identified by the previous, smaller analyses of AGRE 
samples, and/or to confirm the loci reported by these 
analyses. As genotype data were collected in two stages, 
we use the stage 1 sample to identify linked and sex- 
differential risk loci genome-wide, and the combined 
sample (union of stages 1 and 2) to evaluate identified 
loci from stage 1 and earlier reports (discovery studies) for 
confirmation of linkage in the largest available AGRE 
genotype data set. 
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Linkage in all families 

Non-parametric, genome-wide linkage analysis for ASD af- 
fection status in all stage 1 multiplex families (n = 847) iden- 
tified four genomic regions with a peak LOD score >2.2 
(Table 1, Figure 2), the threshold for suggestive linkage for 
a complex trait when allele sharing is tested in sibling pairs 
[43]. We observed the highest LOD score at chromosome 
6q27, with LOD ALL . S ti = 3.22 at rs4708676 (190.611 cM). 
A 2-LOD interval from this peak SNP spans 18.2 Mb, 
31.6 cM, and 100 RefSeq genes (Additional file 2: 
Figure S1A), and the peak SNP is 75 kb upstream from 
gene FRMD1 [GenBank:NM_024919] (0.4-LOD drop 
from peak), a gene with a role in immune function and 
significantly associated with IL-2 secretion [47]. All other 
linkage regions yielded peak LOD >2.2 and are located at 
chromosomes 4ql3.1 (LODall.su = 2.3), 8p21.2 (LOD ALL . 
su = 2.55), and 8ql3.2 (LODall.su = 2.5). 



To determine if the signals observed in the stage 1 
sample and in previous AGRE studies could be improved 
or replicated in a larger sample size, we next carried out 
an extension study by testing these regions for linkage in 
the combined sample of 1,008 multiplex families (union of 
stage 1 and stage 2 samples). Non-parametric linkage ana- 
lysis of the combined sample within the regions of interest 
confirmed three loci above the suggestive threshold of 
LOD >2.2: 6q27 (LOD ALL . Com = 2.50; Additional file 2: 
Figure S1A), 8ql3.2 (LOD ALL . C om = 2.82; Additional file 2: 
Figure SIB), and 20pl3 (LOD ALL . Com = 3.02; Figure 3). Of 
these three loci, only the signal at 8ql3.2 increased in the 
combined sample. Combined sample linkage on other 
chromosomes of interest is shown in Figure 4. 

The strongest signal that we observed from the com- 
bined sample, and the only signal that surpassed a LOD 
score of 3.0, is at 20pl3. This locus had been identified 



Table 1 Summary of significant and suggestive linkage peaks from AGRE samples 

Discovery sample Combined sample (Stage 1 + Stage 2) 



Source 


Number 
of families 


Subgroup 


Locus* 


Peak LOD 


Number 
of families 


Peak LOD 


Peak P-value 


Peak SNP 


Liu etal. (2001) [26] 


110 


ALL 


5p13.1 


2.55 


1008 


0.99 


0.0163 


rs6884342 




110 


ALL 


Xq26.1 


2.56 


1008 


1.12 


0.0116 


rsl 2557711 


Yonan et al. (2003) [34] 


345 


ALL 


5p13.1 


2.54 


1008 


0.99 


0.0163 


rs6884342 




345 


ALL 


1 1 pi 3-1 1 .2 


2.24 


1008 


0.92 


0.0199 


rs2984699 




345 


ALL 


17q11.2 


2.83 


1008 


0.53 


0.0592 


rsl 382779 


Stone et al. (2004) [28] 


148 


MO 


17q11 


4.3 


602 


0.42 


0.0834 


rs4795708 




109 


FC 


4q32.3-35.1 


2.7 


406 


1.10 


0.0121 


rsl 71 7072 


Cantor et al. (2005) [25] 


196 


MO 


17q21 


3.6 


602 


0.47 


0.0706 


rs 1877032 


McCauley et al. (2005) [27] 


158 (85 AGRE) 


ALL 


3p25.3 


2.22 


1008 


0.04 


0.3344 


rs 1400207 


Szatmari et al. (2007) [32] 


741 (211 AGRE) 


MO 


5q12 


2.31 


602 


1.42 


5.34e-3 


rs706725 




741 (211 AGRE) 


MO 


9q33.3 


2.36 


602 


1.47 


4.64e-3 


rs204169 




1181 (387 AGRE) 


ALL 


1 1 pi 2 


2.77 


1008 


0.92 


0.0199 


rs2984699 




440 (176 AGRE) 


FC 


1 1 pi 2 


2.86 


406 


0.38 


0.0925 


rs404977 


Weiss et al. (2009) [33] 


904 (753 AGRE) 


ALL 


6q27 


2.94 


1008 


2.50 + 


3.43e-4 


rs6931082 




904 (753 AGRE) 


ALL 


20p13 


3.81 


1008 


3.02 + 


9.55e-5 


rs6 139007 


Current study Stage 1 sample 


487 


MO 


1 p31 .3 


2.98 


602 


2.55 + 


3.05e-4 


rs7521242 




847 


ALL 


4q1 3.1 


2.30 


1008 


2.14 


8.37e-4 


rs 1483288 




487 


MO 


4q26 


2.41 


602 


1.28 


7.60e-3 


rs2196712 




847 


ALL 


6q27 


3.22 


1008 


2.50 + 


3.43e-4 


rs6931082 




487 


MO 


6q27 


2.86 


602 


2.07 


1 .02e-3 


rs960145 




314 


FC 


8p21.2 


2.67 


406 


1.42 


5.25e-3 


rs7001120 




847 


ALL 


8p21.2 


2.55 


1008 


2.18 


7.60e-4 


rsl 3257637 




314 


FC 


8p12 


2.37 


406 


1.34 


6.51e-3 


rs2976525 




847 


ALL 


8q13.2 


2.50 


1008 


2.82 + 


1 .58e-4 


rs4738003 



*Refer to Additional file 3: Table S2 for locus details, including boundaries and peak markers. 

Bold type indicates logarithm of odds (LOD) scores passing genome-wide significance thresholds of LOD >3.6 for discovery studies, 

+ LOD scores passing suggestive linkage thresholds of LOD >2.2 in the combined sample extension study. AGRE, Autism Genetics Resource Exchange; ALL, all 
families; FC, female-containing; MO, male-only; SNP, single nucleotide polymorphism. 
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Figure 2 Genome-wide linkage: stage 1 sample. Logarithm of odds (LOD) scores from genome-wide, non-parametric, multipoint linkage 
analysis for autism spectrum disorder affection status in the stage 1 sample are plotted for autosomes (Merlin) and chromosome X (Minx). Top 
panel: all multiplex families (ALL); middle: male-only families (MO); bottom: female-containing families (FC). Dashed lines mark LOD thresholds 2.2 
for suggestive and 3.6 for significant linkage [39]. Arrows note signals with LOD >2.2. 



previously in a study that combined 753 AGRE families 
with other families from the National Institute of Mental 
Health repository [33]. Since 665 of the families in that 
analysis overlapped with the current sample, we formally 
tested for replication of this locus by analyzing only 
those families unique to our study (n = 343 multiplex 
families) [33]. We identified a linkage peak within the 
20pl3 region of interest with a P-value of 0.0076 at 
rs2 14828, thus meeting the significance threshold for 
independent replication of linkage for a complex trait 
(P < 0.01) [43]. An additional five nearby SNPs also had 
P- values of less than 0.01 in this independent family 
set. From the peak LOD ALL . Co m of 3.02 at SNP rs6139007 
(1.613 cM) in the combined sample, a 2-LOD drop sup- 
port interval at this locus spans 4.8 Mb, 14.2 cM, and 87 
RefSeq genes (Figure 3). The combined sample peak SNP 
is located just 500 bp upstream from the transcription 
start site of TRIB3 [GenBank:NM_021158] (0.1-LOD 
drop from peak), which encodes a regulator of AKT1 
[GenBank:NG_012188] and is expressed mainly in pan- 
creas, bone marrow, and leukocytes [48,49]. The inde- 
pendent sample peak SNP, rs214828 at 8.403 cM, is 
intronic to gene TGM3 [GenBank:NM_003245] (0-LOD 
drop from peak), which encodes a calcium-dependent 
peptide cross-linking enzyme [50]. 

While signals at 6q27 and 8ql3.2 did not achieve 
genome-wide significance in either stage of analysis, the 
consistency of the signal at these loci across the discovery 
and combined samples suggests these regions may harbor 
ASD risk variants; each encompasses promising candidate 
genes, including SULF1 [GenBank:NM_00 1128206] (0-LOD 
drop from peak at 8ql3.2, Additional file 2: Figure SIB) 



located directly under the 8ql3.2 linkage peak and whose 
protein product interacts with growth factors and cy- 
tokines in cell signaling [51], and PARK2 [GenBank: 
NG_008289] and RPS6KA2 [GenBank:NM_021135] (1.7- 
and 0.6-LOD drop, respectively, from peak at 6q27, 
Additional file 2: Figure S1A), which are both located 
within rare CNVs identified in ASD cases [52-54]. 

Sex-stratified linkage 

To identify sex-differential ASD risk loci, we analyzed 
the multiplex families in two separate groups according 
to the sex of the affected children in each family: MO 
and FC (see Methods) [28]. This sex stratification approach 
has been applied only twice in earlier analyses of exclusively 
AGRE families, in subgroups only one-third as large as 
our stage 1 MO and FC subgroups [25,28]. These earlier 
analyses identified and replicated a genome-wide signifi- 
cant signal at 17qll-q21 in the MO subgroup, which has 
not been subsequently replicated in larger studies [32]. 
Using our larger sample, we aimed to identify additional 
sex-differential risk loci from both the MO and FC 
subgroups. 

Separate non-parametric, genome-wide linkage analyses 
for ASD affection status in the stage 1 MO (n = 487 
families) and FC (n = 314 families) subgroups identified 
five loci with LOD scores >2.2 (Table 2, Figure 2), two 
of which overlap peaks from all families (6q27, 8p21.2) 
and three that are suggestive only in either the MO or 
FC subgroup (lp31.3, 4q26, 8pl2). 

We observed the highest LOD score for the MO sub- 
set at chromosome lp31.3, with LOD M0 .sti = 2.98 at 
rs7521242 (92.905 cM); a 2-LOD interval from this peak 
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Figure 3 Independent replication of genome-wide significant linkage at 20p13. Genome-wide significant linkage signal at 20p13 from the 
combined sample (ALL fams; green, solid line), Autism Genetics Resource Exchange families (AGRE fams) analyzed by Weiss et al. [33] (orange, dashed 
line), and all AGRE families not previously analyzed by Weiss et al. [33] (Ind. Fams; purple, solid line). Top: linkage across the full chromosome 20; middle: 
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SNP spans 10.0 Mb, 137 cM, and 48 RefSeq genes. Ana- 
lysis of this locus in the MO subgroup from the combined 
sample identified a smaller, but still suggestive, peak 
LOD M o.Com = 2.55 also at rs7521242 (Figure 5A). In both 
samples, this MO-specific peak is centered on gene NFIA 
[GenBank:NG_011787] (0-LOD drop from peak), which is 
expressed in the central nervous system and plays a sig- 
nificant role in glial cell fate determination and in normal 
development of the corpus callosum [55,56]. Exome 
sequencing has also identified a de novo, non-synonymous, 
loss of function SNV in an autistic subject in this gene, 
although the SNV carrier is female [7]. 

From the FC stage 1 subset, we observed the highest 
LOD score at chromosome 8p21.2, with LOD FCStl = 
2.67 at rsl0111167 (46.967 cM); a 2-LOD interval from 
this peak SNP spans 6.4 Mb, 9.9 cM, and 73 RefSeq 
genes (Figure 5C). This FC-specific peak is centered on 
gene EBF2 [GenBank:NG_030344] (0-LOD drop from 
peak), which encodes a transcription factor that may act 
alongside WNT1 [GenBank:NG_033141] to regulate cellu- 
lar differentiation during development [57]. However, LOD 



scores in this region from the combined sample FC sub- 
group did not reach the suggestive LOD threshold of 2.2. 
Similarly, the other sex subgroup-specific peaks at chro- 
mosomes 6q27 (LOD M0 .sti = 2.86), 4q26 (LOD M0 .sti = 
2.41), and 8pl2 (LOD FCSt i = 2.37), only exceeded LOD of 
2.2 in the stage 1 sample (Table 1, Table 2). 

We next sought to calculate the likelihood of observing 
the MO and FC signals that surpassed (lp31.3, 4q26, 
8p21.2, 8pl2) or approached (6q27) the magnitude of the 
LOD signal from all families by chance. To assess the sig- 
nificance of this signal enrichment in the sex-defined sub- 
sets, we performed a randomization test for linkage in 
10,000 N-matched subsets of families from the full multi- 
plex family cohort. The empirical P-value indicates subset- 
specific linkage enrichment (empirical P<0.05 at peak 
SNP) at chromosomes lp31.3 (MO) in both the stage 1 and 
the combined samples, and at 8p21.2 (FC), and 8pl2 (FC) 
in the stage 1 sample (Table 2, Figure 5). These results sug- 
gest that the stratification of cohorts by proband sex can re- 
veal sex-differential ASD risk loci. However, only the MO 
locus at lp31.3 was supported in the combined sample. 
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Figure 4 Linkage in regions of interest from previous studies: combined sample. Logarithm of odds (LOD) scores from non-parametric, 
multipoint linkage analysis for autism spectrum disorder affection status in the combined sample (Merlin) are plotted for chromosomes with 
suggestive linkage peaks (LOD >2.2) from previous studies. Colored bars and black marks indicate the spans and peaks of linkage regions of interest, 
respectively; text indicates the linkage region source (L, Liu et al, 2001 [26]; Y, Yonan et al, 2003 [34]; St, Stone et al., 2004 [28]; C, Cantor et al., 2005 
[25]; M, McCauley et al., 2005 [27]; Sz, Szatmari et al., 2007 [32]; W, Weiss et al, 2009 [33]; green and ALL, all multiplex families; blue and MO, male-only 
families; red and FC, female-containing families). Dashed lines mark LOD thresholds of 2.2 for suggestive and 3.6 for significant linkage. 
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Table 2 Sex subset-specific linkage signals 


Locus 


Subgroup 


Stage 1 sample 
peak LOD 


Combined sample 
peak LOD 


Gene at suggestive 
linkage peak 


Sex-differential linkage 
empirical P-value 


1 p31 .3 


MO 


2.98 




NFIA 


<0.01 




MO 




2.55 


NFIA 


<0.0005 


4q26 


MO 


2.41 


1.28 


SYNP02 


NS 


6q27 


MO 


2.86 


2.07 


MLLT4 


NS 


8p21.2 


FC 


2.67 


1.42 


EBF2 


<0.05 


8p12 


FC 


2.37 


1.34 


NRG1 


<0.005 



Bold type indicates significant sex-differential linkage signals with empirical P<0.05. FC, female-containing; LOD, logarithm of odds; MO, male-only; NS, not significant. 



Aside from the signal identified by Weiss and colleagues 
[33] at 20pl3 in a non-sex-stratified sample, the only 
other locus at which genome-wide significant linkage was 
previously observed was in a MO subset at 17qll [28]. 
This signal was subsequently replicated in an additional 
MO subset of AGRE families [25]. In an attempt to further 
replicate the signal at this locus, we tested the 407 MO 
families from the combined stage that were not included 
in either previous analysis. The most significant linkage 
P-value that we observe within the 2-LOD drop interval 
from the peak signal observed by Cantor and colleagues 
[25] (Additional file 3: Table S2) is 0.1003 at rs2014209; 
thus, we do not independently replicate the MO signal 
at 17q in this sample. 

Linkage-directed association 

We next tested for association with common genotyped 
and imputed SNPs within the 2-LOD intervals around 
each linkage peak with LOD >2.2 in the stage 1 or the 
combined sample. For regions with suggestive signals 
from the MO or FC subsets, association testing was run 
on the corresponding subset from the combined stage, 
with the addition of families with multiple affected cousins 
instead of multiple affected siblings. For regions with 
suggestive signals from all multiplex families (ALL), as- 
sociation testing was run on all multiplex families from 
the combined stage with the addition of 508 families with 
only a single genotyped, affected member. While we 
observe some clusters of SNPs that approach significance, 
we did not identify any SNP that survived multiple testing 
correction for the number of independent SNPs within 
each region, defined by pairwise LD r 2 < 0.3. We highlight 
the top association signals here for the sake of interest, 
although we emphasize that none pass our correction 
for multiple comparisons. 

The strongest unadjusted association signal within any 
linkage region occurred in the ALL peak on chromosome 
4ql3.1 at rsll5667468 with a P-value of 2.997 x 10" 5 (after 
regional correction, P = 0.154). This associated SNP is 
located 7.8 Mb from the SNP at the linkage peak and is 
intronic to gene NPFFR2 [GenBank:NM_004885], and 
the minor allele T was found to be under-transmitted to 



affected offspring from ALL families (odds ratio = 0.4143). 
In contrast, the strongest corrected association signal 
occurred in the FC peak on chromosome 8p21.2 at 
rs78485638 with a corrected P-value of 0.052 (unadjusted 
P = 4.206 x 10" 5 ; Figure 5D). This SNP is located 2.7 Mb 
from the SNP at the linkage peak, is intronic to gene 
LOXL2 [GenBank:NG_002318], and the minor allele T 
was found to be under-transmitted to affected offspring 
from FC families (odds ratio = 0.4512). 

Discussion 

To identify and support genomic loci likely to contain 
variants contributing to ASD risk in multiplex families 
from the AGRE collection, we performed linkage analyses 
in all and sex-stratified subsets of multiplex families 
followed by targeted association testing in 1,521 families 
(1,008 multiplex) from the AGRE cohort. The strongest 
linkage signal that we identified was on chromosome 
20pl3, which exceeded a LOD score of 3.0 in our combined 
sample of 1,008 multiplex families. At 20pl3, we also rep- 
licated in an independent sample from the same AGRE 
cohort a previous report of significant linkage at this locus 
[33]. Analyses of sex-defined family subgroups and 
randomization testing for signals from these subgroups 
identified a locus at chromosome lp31.3 that showed 
significant linkage in the MO subgroup in both the stage 1 
and combined samples, and loci at chromosomes 8p21.2 
and 8pl2 that showed significant linkage in the FC 
subgroup only in the stage 1 sample. No genotyped or 
imputed common SNPs within any linked region proved 
to be significantly associated with ASD, an observation 
consistent with a model where the influence of multiple 
loci of very small effect size, or of rare variants, contributes 
to ASD. 

The linkage signal at 20pl3 is especially noteworthy as 
it was the most significant signal that we observed in 
our combined sample with a LOD A i LCom of 3.02, and it 
also meets the criterion for an independent replication 
of genome-wide significant linkage [43]. In addition, as 
both this study and the study by Weiss and colleagues 
[33] were conducted using the AGRE collection, a sample 
that has been uniformly ascertained and evaluated over 
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Figure 5 (See legend on next page.) 



Werling et al. Molecular Autism 2014, 5:13 
http://www.molecularautism.eom/content/5/1/13 



Page 12 of 16 



(See figure on previous page.) 

Figure 5 Significant sex-differential linkage peaks. Regions of sex subgroup-specific suggestive linkage (logarithm of odds (LOD) >2.2) with 
empirically significant signal enrichment from randomization testing are plotted. (A) 1p31.3, MO; top: linkage across full chromosome 1 from all 
stage 1 family groups; upper-middle: linkage from all stage 1 (solid lines) and combined sample (dashed lines) family groups and association 
signal from transmission disequilibrium test (TDT; black dots, EMP1 is empirical P-value from TDT) in male-only (MO) families across 2-LOD interval 
from peak LOD; lower-middle: empirical P-values from test of linkage subgroup specificity (black = stage 1, gray = combined), dotted lines indicate 
thresholds corresponding to empirical P-values of 0.05, 0.01, and 0.005; bottom: RefSeq gene alignment in 2-LOD interval. (B) Linkage across full 
chromosome 8 from all stage 1 family groups. (C) 8p21.2, female-containing (FC), (D) 8p12, FC; for each, top: linkage from all stage 1 family 
groups and TDT association in FC families across 2-LOD interval from peak LOD; middle: empirical P-values from stratification permutation tests; 
bottom, RefSeq gene alignment in 2-LOD interval. 



time, this is a clear replication. A 2-LOD interval from the 
combined sample peak spans such potential risk candidate 
genes as NRSN2 [GenBank:NM_024958] (0.3-LOD drop 
from peak), a gene expressed throughout the cerebral 
cortex, thalamus, hypothalamus, and in Purkinje cells 
[58], and CSNK2A1 [GenBank:NG_011970] (0.5-LOD 
drop from peak), which encodes a protein involved in 
the regulation of circadian rhythms [59]. 

For nearly all linked regions identified in the discovery 
samples, including 20pl3, LOD scores decreased when 
analyzed in the combined sample despite the increased 
sample size, consistent with previous observations of 
genetic heterogeneity [60-62]. However, it is interesting 
to note that those signals that were confirmed or repli- 
cated in the combined or independent sample were ini- 
tially observed by tests of relatively large discovery 
samples. This is because, like association testing, a very 
large number of small pedigrees are critical to identify 
robust linkage signals [63]. In smaller samples, signal 
fluctuations between analyses are caused by extreme 
heterogeneity of genetic risk loci for ASD [60,61,64], 
such that each analysis identifies different, but potentially 
true, risk loci. Since linkage peaks represent deviations 
from expected proportions of affected family members 
sharing two, one, or zero alleles identical by descent at a 
particular locus, linkage analyses for complex traits are 
particularly sensitive to the composition of families 
included in any one test. If, as projected, there are close to 
1,000 risk genes for ASD [12], then the chances that a suf- 
ficient proportion of families in current sample sizes share 
the same risk locus are low. 

We recognize that the power we have to detect linkage 
in our analyses is for complete linkage with loci of modest 
genetic effects on ASD risk [63], and so it is not surprising 
that we do not observe genome-wide significant linkage in 
either the stage 1 or extended, combined stage samples, 
and that most suggestive signals fail to increase in the 
extended sample. However, we emphasize that only now 
are we approaching the sample sizes necessary to detect 
significant linkage in the face of locus heterogeneity in 
cohorts comprised of small pedigrees, and that any possible 
reduction of locus heterogeneity (for example, by testing 
families ascertained by the same cohort as is done here), 
will be key to identify replicable linkage signals. 



Rational stratification of cohorts into subgroups based 
on a shared trait may also facilitate the discovery of risk 
loci by increasing the relative homogeneity of specific 
genetic risk factor(s) in the subgroup. Here, we stratified 
our sample by the sex of the probands within each family 
to identify loci with sex-differential relationships with 
ASD risk, and found significant risk loci at chromosome 
lp31.3 (MO), 8p21.2 (FC), and 8pl2 (FC). At lp31.3 and 
8pl2, linkage signals from the MO and FC subsets, re- 
spectively, were significantly stronger than the signal 
from the full, non-stratified cohort, suggesting that this 
sex-based stratification approach can indeed reduce the 
genetic heterogeneity within each subgroup that would 
otherwise obscure signals at these loci. 

We note that these loci should not be interpreted as 
simply sex-specific. Although we cannot say that the MO 
group is perfectly restricted to families who carry solely 
male-specific variants and therefore only have, and would 
ever have, affected male children, we do assume that the 
MO subgroup is substantially enriched for families who 
carry risk variants that are more penetrant in males. Thus, 
signals specific to the MO subset are more likely to be 
male-specific. In contrast, since the FC group includes 
affected brothers of autistic girls, FC signals are best 
interpreted not as female-specific, but as female-affecting 
risk loci, in accord with a hypothesis that only a subset of 
ASD risk loci are penetrant in females [7,9-13,65]. 

The significant MO and FC signals identified here impli- 
cate regions containing promising candidate genes that 
warrant further exploration by targeted re-sequencing 
(for example, [66-69]). The MO peak at lp31.3 is located 
directly over NFIA [GenBank:NG_0 11787], whose gene 
product has transcription factor activity and has been 
implicated in central nervous system development [55,56]. 
Rare deletions encompassing this gene have been identified 
in subjects with ASD [70], as well as de novo mutations [7]. 
The FC peak at 8p21.2 spans several candidates, including 
STC1 [GenBank:NG_029711] (1.1-LOD drop from peak), 
which encodes a glycoprotein regulated by calcium 
that may act to protect neurons from ischemia and 
hypoxia [71], and neurofilament genes NEFM [GenBank: 
NG_008388] and NEFL [GenBank:NG_008492] (both 1.1- 
LOD drop from peak) whose products likely function in 
transport to neuronal projections [72]. Potentially relevant 
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to sex-differential risk, GNRH1 [GenBank:NG_016457] 
(0.7-LOD drop from peak) is also located within this link- 
age region, and mutations in this gene are likely to affect 
gonadal function [73], perhaps differentially modulating 
downstream manifestation of ASD risk factors in males 
and females. The neighboring FC peak at chromosome 
8pl2 is located directly over NRG1 [GenBank:NG_012005] 
(0-LOD drop from peak), a known schizophrenia risk 
gene [74,75]. 

A similar stratification approach for the identification 
of male-specific and female-affecting ASD risk loci has 
been successfully applied previously by Stone and col- 
leagues [28] to an early iteration of the AGRE cohort, 
as well as by Szatmari and colleagues [32] to a sample 
that included a subset of AGRE families, and Lamb and 
colleagues [30] and Schellenberg and colleagues [29] to 
other ASD family samples. No convergence in linkage 
signals was observed across these studies. We predicted 
that our ability to identify novel, sex-differential ASD risk 
loci in the present study would be aided by a greater than 
three-fold increase in subjects exclusively from AGRE as 
compared to those used by Stone and colleagues [28] and 
by the increase in coverage afforded by dense SNP data in 
lieu of several hundred microsatellite markers. However, 
the results from our sex-stratified analyses do not reach 
genome-wide significance and also do not align with 
findings from earlier sex stratification analyses, including 
Stone and colleagues [28] who reported a linkage peak at 
17qll-q21 in MO families that was subsequently replicated 
[25]. This variability between studies is again consistent 
with extreme risk locus heterogeneity in ASD [76,77], with 
each analysis of a different combination of ASD families 
identifying different linked regions. 

Under the assumption of genetic heterogeneity, for 
complex traits such as ASD, it is possible that the various 
loci identified by analyses of different family sets flag true 
sites for ASD risk in a proportion of the families tested. 
To pursue this, linked loci will need to be investigated 
more closely to identify the precise variants that effectively 
increase ASD risk. This has so far proven challenging, as 
association testing of densely mapped common SNPs 
within linkage peaks has failed to definitively identify 
risk variants, both in the present study and in previous 
work [33]. 

Although our analyses are likely underpowered to 
identify regionally significant associations with common 
variants of small effect size, it is alternatively possible that 
rare variants, not explicitly tested here, contribute to the 
heritable component of ASD risk. For example, rare vari- 
ants, shared between siblings but private to each nuclear 
family, may cluster in the same gene or set of genes. Since 
rare variants are less likely to be tagged by common SNPs, 
they should be more readily localized by allele-agnostic 
linkage analyses than association testing. As in gene 



discovery studies of sporadic ASD cases, sequencing of 
functional genomic features in linked regions will be 
necessary to identify rare variants and evaluate their role 
in familial ASD risk. In either case, larger family-based 
cohorts will be needed to improve power. 

Conclusions 

We conclude that the use of linkage analyses in multiplex 
family cohorts has complementary utility to genome-wide 
association studies for the investigation of the familial, 
inherited contribution to ASD risk. This is especially the 
case in the context of rare variants in human disease (for 
example, [78,79]). Additionally, the use of a sex stratifi- 
cation approach facilitates the identification of risk loci 
that are differentially associated with ASD in families 
with autistic sons versus daughters. However, further 
work is needed to determine which gene(s) or genetic 
features within linked regions, especially at chromosome 
20pl3, replicated here, harbor the variants responsible for 
increasing familial and sex-differential genetic risk for ASD. 
Exploring this in detail via targeted sequencing in large 
cohorts will be necessary to elucidate the common versus 
rare genetic contributions to ASD. 
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across 2-LOD interval from peak LOD; bottom: RefSeq gene alignment in 
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table, respectively. 



Abbreviations 

AGRE: Autism Genetics Resource Exchange; ALL: all families; ASD: autism 
spectrum disorders; CNV: copy number variant; FC: female-containing; 
IL: interleukin; LOD: logarithm of odds; MO: male-only; SNP: single nucleotide 
polymorphism; SNV: single nucleotide variant; TDT: transmission 
disequilibrium test. 

Competing interests 

The authors declare that they have no competing interests. 
Authors' contributions 

DMW carried out the linkage and association studies and drafted the 
manuscript. JKL performed preprocessing and quality control on genotype 



Werling et al. Molecular Autism 2014, 5:13 
http://www.molecularautism.eom/content/5/1/13 



Page 14 of 16 



data and provided assistance with genetic analyses. RL carried out the 
genotype imputation for all samples. RMC advised on the design and 
interpretation of the study and helped to draft the manuscript. DHG 
conceived of the study, participated in its design and interpretation, and 
helped to draft the manuscript. All authors read and approved the final 
manuscript. 

Acknowledgments 

We thank the patients and families whose participation makes this work 
possible, as well as Clara M Lajonchere and Ryan Butler from AGRE and Joe 
DeYoung from the University of California, Los Angeles, Neurosciences 
Genomics Core. Thanks also to Lauren Lawrence and Kun Gao for technical 
assistance. AGRE is a program of Autism Speaks and is supported by grant 
NIMH 1U24MH081810 to Clara M Lajonchere. DMW was supported by NIMH 
F31MH093086. RL was supported by Autism Speaks, fellowship #7436. This 
work was supported by ACE Network grant 5R01 R01MH081754 and ACE 
Center grant 3P50 HD055784 to DHG. 

Author details 

interdepartmental PhD Program in Neuroscience, Brain Research Institute, 
University of California, Los Angeles, 90095 Los Angeles, CA, USA. 
Neurogenetics Program and Department of Neurology, David Geffen School 
of Medicine, University of California, Los Angeles, 90095 Los Angeles, CA, 
USA. 3 Center for Autism Research and Treatment, Semel Institute, David 
Geffen School of Medicine, University of California, Los Angeles, 90095 Los 
Angeles, CA, USA. 4 Center for Neurobehavioral Genetics, Semel Institute, 
University of California, Los Angeles, 90095 Los Angeles, CA, USA. 
department of Human Genetics, University of California, Los Angeles, 695 
Charles E. Young Dr. South, Gonda 2506, 90095 Los Angeles, CA, USA. 

Received: 27 August 2013 Accepted: 24 January 2014 
Published: 17 February 2014 



References 

1. Autism and Developmental Disabilities Monitoring Network Surveillance 
Year 2008 Principal Investigators, Centers for Disease Control and 
Prevention: Prevalence of autism spectrum disorders - autism and devel- 
opmental disabilities monitoring network, 14 sites, United States, 2008. 
MMWR Surveill Summ 2012, 61:1-19. 

2. Hallmayer J, Cleveland S, Torres A, Phillips J, Cohen B, Torigoe T, Miller J, 
Fedele A, Collins J, Smith K, Lotspeich L, Croen LA, Ozonoff S, Lajonchere C, 
Grether JK, Risch N: Genetic heritability and shared environmental factors 
among twin pairs with autism. Arch Gen Psychiatry 201 1, 68:1095-1 102. 

3. Constantino JN, Zhang Y, Frazier T, Abbacchi AM, Law P: Sibling recurrence 
and the genetic epidemiology of autism. Am J Psychiatry 2010, 
167:1349-1356. 

4. Ozonoff S, Young GS, Carter A, Messinger D, Yirmiya N, Zwaigenbaum L, Bryson 
S, Carver U, Constantino JN, Dobkins K, Hutman T, Iverson JM, Landa R, Rogers 
SJ, Sigman M, Stone WL: Recurrence risk for autism spectrum disorders: a baby 
siblings research consortium study. Pediatrics 201 1 , 1 28:e488-e495. 

5. Berg JM, Geschwind DH: Autism genetics: searching for specificity and 
convergence. Genome Biol 2012, 13:247. 

6. Abrahams BS, Geschwind DH: Advances in autism genetics: on the 
threshold of a new neurobiology. Nat Rev Genet 2008, 9:341-355. 

7. lossifov I, Ronemus M, Levy D, Wang Z, Hakker I, Rosenbaum J, Yamrom B, 
Lee YH, Narzisi G, Leotta A, Kendall J, Grabowska E, Ma B, Marks S, Rodgers 
L, Stepansky A, Troge J, Andrews P, Bekritsky M, Pradhan K, Ghiban E, 
Kramer M, Parla J, Demeter R, Fulton LL, Fulton RS, Magrini VJ, Ye K, Darnell 
JC, Darnell RB, et al: De novo gene disruptions in children on the autistic 
spectrum. Neuron 2012, 74:285-299. 

8. Levy D, Ronemus M, Yamrom B, Lee YH, Leotta A, Kendall J, Marks S, 
Lakshmi B, Pai D, Ye K, Buja A, Krieger A, Yoon S, Troge J, Rodgers L, lossifov 
I, Wigler M: Rare de novo and transmitted copy-number variation in 
autistic spectrum disorders. Neuron 201 1, 70:886-897. 

9. Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A, Lin CF, Stevens C, 
Wang LS, Makarov V, Polak P, Yoon S, Maguire J, Crawford EL, Campbell NG, 
Geller ET, Valladares O, Schafer C, Liu H, Zhao T, Cai G, Lihm J, Dannenfelser 
R, Jabado O, Peralta Z, Nagaswamy U, Muzny D, Reid JG, Newsham I, Wu Y, 
et al: Patterns and rates of exonic de novo mutations in autism spectrum 
disorders. Nature 2012, 485:242-245. 



10. O'Roak BJ, Vives L, Girirajan S, Karakoc E, Krumm N, Coe BP, Levy R, Ko A, 
Lee C, Smith JD, Turner EH, Stanaway IB, Vernot B, Malig M, Baker C, 
Reilly B, Akey JM, Borenstein E, Rieder MJ, Nickerson DA, Bernier R, 
Shendure J, Eichler EE: Sporadic autism exomes reveal a highly 
interconnected protein network of de novo mutations. Nature 2012, 
485:246-250. 

11. Sanders SJ, Ercan-Sencicek AG, Hus V, Luo R, Murtha MT, Moreno-De-Luca 
D, Chu SH, Moreau MP, Gupta AR, Thomson SA, Mason CE, Bilguvar K, 
Celestino-Soper PB, Choi M, Crawford EL, Davis L, Wright NR, Dhodapkar 
RM, DiCola M, DiLullo NM, Fernandez TV, Fielding-Singh V, Fishman DO, 
Frahm S, Garagaloyan R, Goh GS, Kammela S, Klei L, Lowe JK, Lund SC, et al: 
Multiple recurrent de novo CNVs, including duplications of the 7q1 1.23 
Williams syndrome region, are strongly associated with autism. Neuron 
2011,70:863-885. 

12. Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, Willsey AJ, 
Ercan-Sencicek AG, DiLullo NM, Parikshak NN, Stein JL, Walker MF, Ober GT, 
Teran NA, Song Y, El-Fishawy P, Murtha RC, Choi M, Overton JD, Bjornson 
RD, Carriero NJ, Meyer KA, Bilguvar K, Mane SM, Sestan N, Lifton RP, Gunel 
M, Roeder K, Geschwind DH, Devlin B, State MW: De novo mutations 
revealed by whole-exome sequencing are strongly associated with 
autism. Nature 2012, 485:237-241. 

13. Sebat J, Lakshmi B, Malhotra D, Troge J, Lese-Martin C, Walsh T, Yamrom B, 
Yoon S, Krasnitz A, Kendall J, Leotta A, Pai D, Zhang R, Lee YH, Hicks J, 
Spence SJ, Lee AT, Puura K, Lehtimaki T, Ledbetter D, Gregersen PK, Bregman J, 
Sutcliffe JS, Jobanputra V, Chung W, Warburton D, King MC, Skuse D, 
Geschwind DH, Gilliam TC, et al: Strong association of de novo copy 
number mutations with autism. Science 2007, 316:445-449. 

14. Klei L, Sanders SJ, Murtha MT, Hus V, Lowe JK, Willsey AJ, Moreno-De-Luca 
D, Yu TW, Fombonne E, Geschwind D, Grice DE, Ledbetter DH, Lord C, Mane 
SM, Martin CL, Martin DM, Morrow EM, Walsh CA, Melhem NM, Chaste P, 
Sutcliffe JS, State MW, Cook EH Jr, Roeder K, Devlin B: Common genetic 
variants, acting additively, are a major source of risk for autism. Molecular 
Autism 2012, 3:9. 

15. Folstein SE, Piven J: Etiology of autism: genetic influences. Pediatrics 1991, 
87:767-773. 

16. Piven J, Palmer P, Jacobi D, Childress D, Arndt S: Broader autism 
phenotype: evidence from a family history study of multiple-incidence 
autism families. Am J Psychiatry 1997, 154:185-190. 

17. Bailey A, Le Couteur A, Gottesman I, Bolton P, Simonoff E, Yuzda E, Rutter 
M: Autism as a strongly genetic disorder: evidence from a British twin 
study. Psychol Med 1995, 25:63-77. 

18. Constantino JN, Todd RD: Intergenerational transmission of subthreshold 
autistic traits in the general population. Biol Psychiatry 2005, 57:655-660. 

19. Fombonne E: Epidemiology of pervasive developmental disorders. Pediatr 
Res 2009, 65:591-598. 

20. Werling DM, Geschwind DH: Sex differences in autism spectrum 
disorders. Curr Opin Neurol 2013, 26:146-153. 

21 . Gecz J, Shoubridge C, Corbett M: The genetic landscape of intellectual 
disability arising from chromosome X. Trends in Genet: TIG 2009, 25:308-316. 

22. Bailey DB Jr, Mesibov GB, Hatton DD, Clark RD, Roberts JE, Mayhew L: 
Autistic behavior in young boys with fragile X syndrome. J Autism Dev 
Disord 1998, 28:499-508. 

23. Jamain S, Quach H, Betancur C, Rastam M, Colineaux C, Gillberg IC, 
Soderstrom H, Giros B, Leboyer M, Gillberg C, Bourgeron T: Mutations of the 
X-linked genes encoding neuroligins NLGN3 and NLGN4 are associated 
with autism. Nat Genet 2003, 34:27-29. 

24. Sato D, Lionel AC, Leblond CS, Prasad A, Pinto D, Walker S, O'Connor I, 
Russell C, Drmic IE, Hamdan FF, Michaud JL, Endris V, Roeth R, Delorme R, 
Huguet G, Leboyer M, Rastam M, Gillberg C, Lathrop M, Stavropoulos DJ, 
Anagnostou E, Weksberg R, Fombonne E, Zwaigenbaum L, Fernandez BA, 
Roberts W, Rappold GA, Marshall CR, Bourgeron T, Szatmari P, et al: SHANK1 
deletions in males with autism spectrum disorder. Am J Hum Genet 2012, 
90:879-887. 

25. Cantor RM, Kono N, Duvall JA, Alvarez-Retuerto A, Stone JL, Alarcon M, 
Nelson SF, Geschwind DH: Replication of autism linkage: fine-mapping 
peak at 1 7q21 . Am J Hum Genet 2005, 76:1 050-1 056. 

26. Liu J, Nyholt DR, Magnussen P, Parano E, Pavone P, Geschwind D, Lord C, 
Iversen P, Hoh J, Ott J, Gilliam TC: A genome-wide screen for autism 
susceptibility loci. Am J Hum Genet 2001, 69:327-340. 

27. McCauley JL, Li C, Jiang L, Olson LM, Crockett G, Gainer K, Folstein SE, 
Haines JL, Sutcliffe JS: Genome-wide and Ordered-Subset linkage analyses 



Werling et al. Molecular Autism 2014, 5:13 
http://www.molecularautism.eom/content/5/1/13 



Page 15 of 16 



provide support for autism loci on 17q and 19p with evidence of 
phenotypic and interlocus genetic correlates. BMC Med Genet 2005, 6:1. 

28. Stone JL, Merriman B, Cantor RM, Yonan AL, Gilliam TC, Geschwind DH, 
Nelson SF: Evidence for sex-specific risk alleles in autism spectrum 
disorder. Am J Hum Genet 2004, 75:1 1 1 7-1 1 23. 

29. Schellenberg GD, Dawson G, Sung YJ, Estes A, Munson J, Rosenthal E, 
Rothstein J, Flodman P, Smith M, Coon H, Leong L, Yu CE, Stodgell C, 
Rodier PM, Spence MA, Minshew N, McMahon WM, Wijsman EM: Evidence 
for multiple loci from a genome scan of autism kindreds. Mol Psychiatry 
2006,1979:1049-1060. 

30. Lamb JA, Barnby G, Bonora E, Sykes N, Bacchelli E, Blasi F, Maestrini E, 
Broxholme J, Tzenova J, Weeks D, Bailey AJ, Monaco AP: Analysis of 
IMGSAC autism susceptibility loci: evidence for sex limited and parent of 
origin specific effects. J Med Genet 2005, 42:132-137. 

31. Barrett S, Beck JC, Bernier R, Bisson E, Braun TA, Casavant TL, Childress D, 
Folstein SE, Garcia M, Gardiner MB, Gilman S, Haines JL, Hopkins K, Landa R, 
Meyer NH, Mullane JA, Nishimura DY, Palmer P, Piven J, Purdy J, Santangelo 
SL, Searby C, Sheffield V, Singleton J, Slager S, Struchen T, Svenson S, 
Vieland V, Wang K, Winklosky B: An autosomal genomic screen for autism. 
Collaborative linkage study of autism. Am J Med Genet 1999, 88:609-615. 

32. Szatmari P, Paterson AD, Zwaigenbaum L, Roberts W, Brian J, Liu XQ, 
Vincent JB, Skaug JL, Thompson AP, Senman L, Feuk L, Qian C, Bryson SE, 
Jones MB, Marshall CR, Scherer SW, Vieland VJ, Bartlett C, Mangin LV, 
Goedken R, Segre A, Pericak-Vance MA, Cuccaro ML, Gilbert JR, Wright HH, 
Abramson RK, Betancur C, Bourgeron T, Gillberg C, Leboyer M, et al: 
Mapping autism risk loci using genetic linkage and chromosomal 
rearrangements. Nat Genet 2007, 39:319-328. 

33. Weiss LA, Arking DE, Daly MJ, Chakravarti A: A genome-wide linkage and 
association scan reveals novel loci for autism. Nature 2009, 461:802-808. 

34. Yonan AL, Alarcon M, Cheng R, Magnusson PK, Spence SJ, Palmer AA, 
Grunn A, Juo SH, Terwilliger JD, Liu J, Cantor RM, Geschwind DH, Gilliam TC: 
A genome-wide screen of 345 families for autism-susceptibility loci. Am J 
Hum Genet 2003, 73:886-897. 

35. Wang K, Zhang H, Ma D, Bucan M, Glessner JT, Abrahams BS, Salyakina D, 
Imielinski M, Bradfield JP, Sleiman PM, Kim CE, Hou C, Frackelton E, Chiavacci R, 
Takahashi N, Sakurai T, Rappaport E, Lajonchere CM, Munson J, Estes A, 
Korvatska O, Piven J, Sonnenblick LI, Alvarez Retuerto Al, Herman El, Dong H, 
Hutman T, Sigman M, Ozonoff S, Klin A, et a I: Common genetic variants on 
5p14.1 associate with autism spectrum disorders. Nature 2009, 459:528-533. 

36. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Mailer J, 
Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole- 
genome association and population-based linkage analyses. Am J Hum 
Genet 2007,81:559-575. 

37. Anney R, Klei L, Pinto D, Regan R, Conroy J, Magalhaes TR, Correia C, 
Abrahams BS, Sykes N, Pagnamenta AT, Almeida J, Bacchelli E, Bailey AJ, 
Baird G, Battaglia A, Berney T, Bolshakova N, Bolte S, Bolton PF, Bourgeron T, 
Brennan S, Brian J, Carson AR, Casallo G, Casey J, Chu SH, Cochrane L, 
Corsello C, Crawford EL, Crossett A, et al: A genome-wide scan for com- 
mon alleles affecting risk for autism. Hum Mol Genet 2010, 19:4072-4082. 

38. Pinto D, Pagnamenta AT, Klei L, Anney R, Merico D, Regan R, Conroy J, 
Magalhaes TR, Correia C, Abrahams BS, Almeida J, Bacchelli E, Bader GD, Bailey 
AJ, Baird G, Battaglia A, Berney T, Bolshakova N, Bolte S, Bolton PF, Bourgeron 
T, Brennan S, Brian J, Bryson SE, Carson AR, Casallo G, Casey J, Chung BH, 
Cochrane L, Corsello C, etal: Functional impact of global rare copy number 
variation in autism spectrum disorders. Nature 2010, 466:368-372. 

39. Kong X, Murphy K, Raj T, He C, White PS, Matise TC: A combined 
linkage-physical map of the human genome. Am J Hum Genet 2004, 
75:1143-1148. 

40. Matise TC, Chen F, Chen W, De La Vega FM, Hansen M, He C, Hyland FC, 
Kennedy GC, Kong X, Murray SS, Ziegle JS, Stewart WC, Buyske S: A 
second-generation combined linkage physical map of the human 
genome. Genome Res 2007, 17:1783-1786. 

41 . Abecasis GR, Cherny SS, Cookson WO, Cardon LR: Merlin-rapid analysis of 
dense genetic maps using sparse gene flow trees. Nat Genet 2002, 30:97-101. 

42. Kong A, Cox NJ: Allele-sharing models: LOD scores and accurate linkage 
tests. Am J Hum Genet 1 997, 61:1 1 79-1 1 88. 

43. Lander E, Kruglyak L: Genetic dissection of complex traits: guidelines for 
interpreting and reporting linkage results. Nat Genet 1995, 11:241-247. 

44. Howie BN, Donnelly P, Marchini J: A flexible and accurate genotype 
imputation method for the next generation of genome-wide association 
studies. PLoS Genet 2009, 5:e1 000529. 



45. Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles 
ME, McVean GA: A map of human genome variation from population-scale 
sequencing. Nature 201 0, 467:1 061 -1 073. 

46. Marchini J, Howie B: Genotype imputation for genome-wide association 
studies. Nat Rev Genet 201 0, 1 1 :499-51 1 . 

47. Kennedy RB, Ovsyannikova IG, Pankratz VS, Haralambieva IH, Vierkant RA, 
Poland GA: Genome-wide analysis of polymorphisms associated with 
cytokine responses in smallpox vaccine recipients. Hum Genet 2012, 
131:1403-1421. 

48. Kiss-Toth E, Bagstaff SM, Sung HY, Jozsa V, Dempsey C, Caunt JC, Oxley KM, 
Wyllie DH, PolgarT, Harte M, O'Neill LA, Qwarnstrom EE, Dower SK: Human 
tribbles, a protein family controlling mitogen-activated protein kinase 
cascades. J Biol Chem 2004, 279:42703-42708. 

49. Du K, Herzig S, Kulkarni RN, Montminy M: TRB3: a tribbles homolog that 
inhibits Akt/PKB activation by insulin in liver. Science 2003, 300:1574-1577. 

50. Kim IG, Lee SC, Lee JH, Yang JM, Chung SI, Steinert PM: Structure and 
organization of the human transglutaminase 3 gene: evolutionary 
relationship to the transglutaminase family. J Invest Dermatol 1994, 
103:137-142. 

51 . Dai Y, Yang Y, MacLeod V, Yue X, Rapraeger AC, Shriver Z, Venkataraman G, 
Sasisekharan R, Sanderson RD: HSulf-1 and HSulf-2 are potent inhibitors of 
myeloma tumor growth in vivo. J Biol Chem 2005, 280:40066-40073. 

52. Glessner JT, Wang K, Cai G, Korvatska O, Kim CE, Wood S, Zhang H, Estes A, 
Brune CW, Bradfield JP, Imielinski M, Frackelton EC, Reichert J, Crawford EL, 
Munson J, Sleiman PM, Chiavacci R, Annaiah K, Thomas K, Hou C, Glaberson 
W, Flory J, Otieno F, Garris M, Soorya L, Klei L, Piven J, Meyer KJ, Anagnostou E, 
Sakurai T, et al: Autism genome-wide copy number variation reveals 
ubiquitin and neuronal genes. Nature 2009, 459:569-573. 

53. Scheuerle A, Wilson K PARK2 copy number aberrations in two children 
presenting with autism spectrum disorder: further support of an association 
and possible evidence for a new microdeletion/microduplication syndrome. 
Am J Med Genet B Neuropsychiatr Genet 201 1 , 1 56B:41 3-420. 

54. Marshall CR, Noor A, Vincent JB, Lionel AC, Feuk L, Skaug J, Shago M, 
Moessner R, Pinto D, Ren Y, Thiruvahindrapduram B, Fiebig A, Schreiber S, 
Friedman J, Ketelaars CE, Vos YJ, Ficicioglu C, Kirkpatrick S, Nicolson R, 
Sloman L, Summers A, Gibbons CA, Teebi A, Chitayat D, Weksberg R, 
Thompson A, Vardy C, Crosbie V, Luscombe S, Baatjes R, et al: Structural 
variation of chromosomes in autism spectrum disorder. Am J Hum Genet 
2008, 82:477-488. 

55. Gronostajski RM, Tolentino-Silva F, Haxhiu MA, Colmenares C, Macklin WB, 
Campbell CE, Butz KG, Duchala CS, Das Neves L: Disruption of the murine 
nuclear factor l-A gene (Nfia) results in perinatal lethality, hydrocephalus, 
and agenesis of the corpus callosum. Proc Natl Acad Sci USA 1999, 
96:11946-11951. 

56. Deneen B, Ho R, Lukaszewicz A, Hochstim CJ, Gronostajski RM, Anderson DJ: 
The transcription factor NFIA controls the onset of gliogenesis in the 
developing spinal cord. Neuron 2006, 52:953-968. 

57. Kieslinger M, Folberth S, Dobreva G, Dorn T, Croci L, Erben R, Consalez GG, 
Grosschedl R: EBF2 regulates osteoblast-dependent differentiation of 
osteoclasts. Dev Cell 2005, 9:757-767. 

58. Nakanishi K, Ida M, Suzuki H, Kitano C, Yamamoto A, Mori N, Araki M, 
Taketani S: Molecular characterization of a transport vesicle protein 
Neurensin-2, a homologue of Neurensin-1, expressed in neural cells. 
Brain Res 2006, 1081:1-8. 

59. Lin JM, Kilman VL, Keegan K, Paddock B, Emery-Le M, Rosbash M, Allada R: 
A role for casein kinase 2alpha in the Drosophila circadian clock. Nature 
2002, 420:816-820. 

60. Huang J, Vieland VJ: Comparison of 'model-free' and 'model-based' 
linkage statistics in the presence of locus heterogeneity: single data set 
and multiple data set applications. Hum Hered 2001, 51:217-225. 

61. Vieland VJ, Wang K, Huang J: Power to detect linkage based on multiple 
sets of data in the presence of locus heterogeneity: comparative 
evaluation of model-based linkage methods for affected sib pair data. 
Hum Hered 2001, 51:199-208. 

62. Yonan AL, Palmer AA, Smith KC, Feldman I, Lee HK, Yonan JM, Fischer SG, 
Pavlidis P, Gilliam TC: Bioinformatic analysis of autism positional 
candidate genes using biological databases and computational gene 
network prediction. Genes Brain Behav 2003, 2:303-320. 

63. Sham PC, Lin MW, Zhao JH, Curtis D: Power comparison of parametric and 
nonparametric linkage tests in small pedigrees. Am J Hum Genet 2000, 
66:1661-1668. 



Werling et al. Molecular Autism 2014, 5:13 
http://www.molecularautism.eom/content/5/1/13 



Page 16 of 16 



64. 



65. 



66. 



67. 



69. 



70. 



72. 



73. 



74. 



75. 



78. 



79. 



Bartlett CW, Goedken R, Vieland VJ: Effects of updating linkage evidence 
across subsets of data: reanalysis of the autism genetic resource 
exchange data set. Am J Hum Genet 2005, 76:688-695. 
Robinson EB, Lichtenstein P, Anckarsater H, Happe F, Ronald A: Examining 
and interpreting the female protective effect against autistic behavior. 
Proc Natl Acad Sci USA 201 3, 1 1 0:5258-5262. 

Brkanac Z, Spencer D, Shendure J, Robertson PD, Matsushita M, Vu T, Bird 
TD, Olson MV, Raskind WH: IFRD1 is a candidate gene for SMNA on 
chromosome 7q22-q23. Am J Hum Genet 2009, 84:692-697. 
Ng SB, Nickerson DA, Bamshad MJ, Shendure J: Massively parallel 
sequencing and rare disease. Hum Mol Genet 2010, 19:R1 19-R124. 
Nikopoulos K, Gilissen C, Hoischen A, van Nouhuys CE, Boonstra FN, 
Blokland EA, Arts P, Wieskamp N, Strom TM, Ayuso C, Tilanus MA, Bouwhuis 
S, Mukhopadhyay A, Scheffer H, Hoefsloot LH, Veltman JA, Cremers FP, 
Collin RW: Next-generation sequencing of a 40 Mb linkage interval 
reveals TSPAN12 mutations in patients with familial exudative 
vitreoretinopathy. Am J Hum Genet 2010, 86:240-247. 
Rehman AU, Morell RJ, Belyantseva IA, Khan SY, Boger ET, Shahzad M, 
Ahmed ZM, Riazuddin S, Khan SN, Riazuddin S, Friedman TB: Targeted 
capture and next-generation sequencing identifies C9orf75, encoding 
taperin, as the mutated gene in nonsyndromic deafness DFNB79. 
Am J Hum Genet 2010, 86:378-388. 

Mikhail FM, Lose EJ, Robin NH, Descartes MD, Rutledge KD, Rutledge SL, 
Korf BR, Carroll AJ: Clinically relevant single gene or intragenic deletions 
encompassing critical neurodevelopmental genes in patients with 
developmental delay, mental retardation, and/or autism spectrum 
disorders. Am J Med Genet A 201 1, 1 55A:2386-2396. 
Zhang K, Lindsberg PJ, TatlisumakT, Kaste M, Olsen HS, Andersson LC: 
Stanniocalcin: a molecular guard of neurons during cerebral ischemia. 
Proc Natl Acad Sci USA 2000, 97:3637-3642. 

Brownlees J, Ackerley S, Grierson AJ, Jacobsen NJ, Shea K, Anderton BH, 
Leigh PN, Shaw CE, Miller CC: Charcot-Marie-Tooth disease neurofilament 
mutations disrupt neurofilament assembly and axonal transport. Hum 

Mol Genet 2002, 11:2837-2844. 

Bouligand J, Ghervan C, Tello JA, Brailly-Tabard S, Salenave S, Chanson P, 
Lombes M, Millar RP, Guiochon-Mantel A, Young J: Isolated familial 
hypogonadotropic hypogonadism and a GNRH1 mutation. N Engl J Med 
2009, 360:2742-2748. 

Stefansson H, Sarginson J, Kong A, Yates P, Steinthorsdottir V, Gudfinnsson 
E, Gunnarsdottir S, Walker N, Petursson H, Crombie C, Ingason A, Gulcher JR, 
Stefansson K, St Clair D: Association of neuregulin 1 with schizophrenia 
confirmed in a Scottish population. Am J Hum Genet 2003, 72:83-87. 
Stefansson H, Sigurdsson E, Steinthorsdottir V, Bjornsdottir S, Sigmundsson 
T, Ghosh S, Brynjolfsson J, Gunnarsdottir S, Ivarsson O, Chou TT, Hjaltason O, 
Birgisdottir B, Jonsson H, Gudnadottir VG, Gudmundsdottir E, Bjornsson A, 
Ingvarsson B, Ingason A, Sigfusson S, Hardardottir H, Harvey RP, Lai D, Zhou 
M, Brunner D, Mutel V, Gonzalo A, Lemke G, Sainz J, Johannesson G, 
Andresson T, et at. Neuregulin 1 and susceptibility to schizophrenia. Am J 
Hum Genet 2002, 71:877-892. 

Vieland VJ: Where's the evidence? Hum Hered 201 1, 71:59-66. 

Vieland VJ: Thermometers: something for statistical geneticists to think 

about. Hum Hered 2006, 61:144-156. 

Barzilai N, Atzmon G, Schechter C, Schaefer EJ, Cupples AL, Lipton R, Cheng 
S, Shuldiner AR: Unique lipoprotein phenotype and genotype associated 
with exceptional longevity. JAMA 2003, 290:2030-2040. 
Yokoyama S, Woods SL, Boyle GM, Aoude LG, MacGregor S, Zismann V, 
Gartside M, Cust AE, Haq R, Harland M, Taylor JC, Duffy DL, Holohan K, 
Dutton-Regester K, Palmer JM, Bonazzi V, Stark MS, Symmons J, Law MH, 
Schmidt C, Lanagan C, O'Connor L, Holland EA, Schmid H, Maskiell JA, 
Jetann J, Ferguson M, Jenkins MA, Kefford RF, Giles GG, etal: A novel recurrent 
mutation in MITF predisposes to familial and sporadic melanoma. Nature 
2011,480:99-103. 



doi:1 0.1 186/2040-2392-5-13 

Cite this article as: Werling et al.: Replication of linkage at chromosome 
20p13 and identification of suggestive sex-differential risk loci for autism 
spectrum disorder. Molecular Autism 2014 5:13. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



o 



BioMed Central 



